Search | WHO COVID-19 Research Database

Shalev, E.; Cohen, I..

2022 30th European Signal Processing Conference (Eusipco 2022) ; : 135-139, 2022.

Article in English | Web of Science | ID: covidwho-2310918

ABSTRACT

Automated audio systems, such as speech emotion recognition, can benefit from the ability to work from another room. No research has yet been conducted on the effectiveness of such systems when the sound source originates in a different room than the target system, and the sound has to travel between the rooms through the wall. New advancements in room-impulse-response generators enable a large-scale simulation of audio sources from adjacent rooms and integration into a training dataset. Such a capability improves the performance of data-driven methods such as deep learning. This paper presents the first evaluation of multiroom speech emotion recognition systems. The isolating policies due to COVID-19 presented many cases of isolated individuals suffering emotional difficulties, where such capabilities would be very beneficial. We perform training, with and without an audio simulation generator, and compare the results of three different models on real data recorded in a real multiroom audio scene. We show that models trained without the new generator achieve poor results when presented with multiroom data. We proceed to show that augmentation using the new generator improves the performances for all three models. Our results demonstrate the advantage of using such a generator. Furthermore, testing with two different deep learning architectures shows that the generator improves the results independently of the given architecture.

Multiroom Speech Emotion Recognition

Shalev, E.; Cohen, I..

30th European Signal Processing Conference, EUSIPCO 2022 ; 2022-August:135-139, 2022.

Article in English | Scopus | ID: covidwho-2102621

ABSTRACT

ABSTRACT

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL